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Abstract 

Futures trading is the core of futures business, and it is considered as one of the typical complex 
systems. To investigate the complexity of futures trading, we employ the analytical method of 
complex networks. First, we use real trading records from the Shanghai Futures Exchange to 
construct futures trading networks, in which nodes are trading participants, and two nodes have 
a common edge if the two corresponding investors appear simultaneously in at least one trading 
record as a purchaser and a seller respectively. Then, we conduct a comprehensive statistical 
analysis on the constructed futures trading networks. Empirical results show that the futures 
trading networks exhibit features such as scale-free behavior with interesting odd-even-degree 
divergence in low-degree regions, small-world effect, hierarchical organization, power-law be- 
tweenness distribution, disassortative mixing, and shrinkage of both the average path length and 
the diameter as network size increases. To the best of our knowledge, this is the first work that 
uses real data to study futures trading networks, and we argue that the research results can shed 
light on the nature of real futures business. 

Keywords: Complex networks. Futures trading networks. Scale-free scaling, Small-world effect 
PACS: 89.75.Fb, 89.75.Hc, 89.65.Gh 



1. Introduction 

Since the works of Watts & Strogatz [1] and Barabasi & Alberta [2] were published, complex 
networks, as a new scientific area, have attracted a tremendous amount of research interest [3- 
7]. Complex networks can describe a wide range of real-life systems in nature and society, 
and show various nontrivial topological characteristics not occurring in simple networks such 
as regular lattices and random networks. There are a number of frequently cited examples that 
have been studied from the perspective of complex networks, including the World Wide Web [8- 
11], the Internet [12], metabolic networks [13], scientific collaboration networks [14, 15], online 
social networks [16, 17], public transport networks [18, 19], airline flight networks [15, 20-23] 
and human language networks [24-26]. Empirical studies on these networks mentioned above 
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have largely motivated the recent curiosity and concern about this new focus of research so that a 
number of techniques and models have been explored to improve people's perception of topology 
and evolution of real complex systems [27-34]. With growing of importance and popularity, 
complex network theory has become a powerful tool with intuitive and effective representations 
to analyze complex systems in a variety of fields, including financial markets [35]. 

In the literature, a number of papers have been dedicated to studying financial markets from 
the perspective of complex networks. The major difference among these works lies in the types 
of networks to be constructed from financial data for characterizing the organization and struc- 
ture of financial markets. Some existing works constructed stock networks whose connectivity is 
defined by the correlation between any two time series of stock prices [36-40]. Some others es- 
tablished directed networks of stock ownership describing the relationship between stockholders 
and companies [41, 42]. Networks of market investors based on transaction interactions between 
the investors were also investigated. For example, Franke et al. [43] analyzed irregular trading 
behaviors of users in an experimental stock market, while Wang et al. [44] studied the evolving 
topology of such a network in an experimental futures exchange. Recently, Jiang et al. [45] has 
investigated stock weighted directed trading networks based on daily transaction records through 
reconstructing the limit order book with real order series from the Shenzhen Stock Exchange. 

The study on financial investor networks can provide clues for revealing the true complexity in 
financial markets, especially futures markets. In a real futures market, the futures trading model 
serves as a matching engine for executing all eligible orders from various market participants, 
and the interactions among the participants form a complex exchange network, which is termed 
as the futures trading network (FTN in short) in this paper Simply put, a FTN consists of a set 
of trading participants, each of which has at least one connection of direct exchange action to 
another by futures contract(s). 

In this paper, we try to provide a comprehensive study on the characteristics of futures trading 
networks established with genuine trading data from the Shanghai Futures Exchange. To the 
best of our knowledge, this is the first work that uses real futures trading data for constructing 
networks. So we assume that the empirical results may unveil to some degree futures trading 
behaviors and shed light on the nature of futures markets. 

The rest of this paper is organized as follows. The FTN construction method is introduced in 
Section 2, including trading dataset and the details of network construction. Section 3 presents 
the empirical results of FTNs. Section 4 concludes the paper and highlights some future work 
directions. 

2, Construction of Futures Trading Networks 

In this section, we first introduce the real trading data from the Shanghai Futures Exchange, 
which will be used for constructing futures trading networks, and then present the details of the 
construction of futures trading networks. 

2.1. Dataset 

For constructing the futures trading networks, we use real trading records from the Shanghai 
Futures Exchange, which is the largest one in China's domestic futures market and has consid- 
erable impact on the global derivative market. Trade records are generated by matching orders 
or quotes from buyers and sellers according to a certain rule of price/time priority (first price, 
then time) via the electronic trading platform of the futures exchange. There are hundreds of 
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thousands of matching results reported from the exchange in one typical trading day. We use a 
dataset involving all futures commodities in the derivative market from July to September, 2008. 
For the trading records, we use virtual and unique IDs for representing the trading participants, 
and all other sensitive information is filtered for privacy preservation reason. 

2.2. Network Construction 

Since a trading record contains the IDs of both the buyer and the seller, we are able to establish 
a futures trading network. The construction process is as follows. A node represents a participant 
in a trading record, and an edge, meaning a trade relation, is estabUshed between two participants 
if their IDs appear simultaneously at least once in one trading record. An example of FTN that 
comprises nine records is shown in Fig. 1. Here, A-H are IDs of trading participants. Each row 
in the right table of Fig. 1 represents a trading record where the seller and buyer columns are the 
IDs of seller and buyer 
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Fig. 1. An example of the futures trading network (left) constructed from a dataset with nine trading records 
(right). 

Following the above process, we first create the largest network with the whole dataset contain- 
ing three months' trading records; this network is denoted as FTN-all. Then, we construct three 
sub-networks based on three subsets of the whole dataset above according to the futures com- 
modity classification of the Shanghai Futures Exchange. These three sub-networks are termed 
as FTN-met, FTN-rub and FTN-oil, which involve only futures commodity metal (concretely 
including copper, aluminum, zinc and gold), natural rubber and fuel oil, respectively. Thus, we 
have totally four FTNs, their statistic information is given in Table 1 . 



3. Properties of Futures Trading Networks 

In what follows, we study the characteristics of FTNs constructed above, our focus is on 
topological features and dynamical properties. Extensive empirical results are presented. 
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3.1. Scale-free Behavior 

The degree distribution P(k) is one of the most important statistical characteristics of a net- 
work, and also one of the simplest properties that can be measured directly. It is defined as the 
probability that a random node in a network has exactly k edges. In many real complex networks, 
P(k) decays with kin a power law, following 

P(k) ~ k-\ (1) 

A network that owns such a property is called a scale-free network [2]. We plot the degree 
distributions P{k) of FTN-all, FTN-met, FTN-rub and FTN-oil in Fig. 2(a)-(d). Our observation 
of the subplots implies that all the four FTNs are scale-free networks. However, reaching a 
conclusion that all FTNs are scale-free requires a deeper analysis by statistic techniques. 

To detect the existence of power-law property in FTNs, we consider the cumulative degree 
distribution function defined as Pcum{k) - Pr(k' > k) because the power-law cumulative degree 
distribution of a network means that its degree distribution obeys a power law [46]. According to 
Ref. [47], we calculate the complementary cumulative degree distributions of FTNs for estimat- 
ing the scaling parameter A and the lower bound k,„i„ of power-law behavior based on the methods 
of maximum likelihood estimators and the quantification of the distance between two probability 
distributions. Furthermore, we conduct a goodness-of-fit test by computing the distance between 
the empirical distribution of FTN and the power-law model with the Kolmogorov-Smirnov (KS) 
statistic, and generating a p-value that quantifies the plausibility of a power-law degree distri- 
bution hypothesis for each FTN. The cumulative degree distributions Pcumik) for all the four 
networks are presented in Fig. 3 with the slope {A-\). Table 2 gives the estimations of A, k„ii„, 
and the p-value of the goodness-of-fit test for each power-law fit of FTN. 

As listed in Table 2, the scaling parameters A of the four FTNs have nearly similar values 
of 2.48, 2.39, 2.53 and 2.39 for FTN-all, FTN-met, FTN-run and FTN-oil, respectively. Their 



Table 1. The statistic information of the four FTNs. FTN-all is based on the whole dataset containing 
three months' trading records from July to September, 2008, FTN-met, FTN-rub and FTN-oil are three 
sub-networks of FTN-all that correspond to three subsets that involve only trading records of the futures 
commodity metal, natural rubber and fuel oil, respectively. 

Network Futures commodity Number of nodes Number of edges 

FTN-all all commodities 100994 8068676 

FTN-met metal 75262 3226119 

FTN-rub natural rubber 55828 3364557 

FTN-oil fuel oil 52208 1657268 



Table 2. The estimations of the lower bound /:„„„ and the scaling parameter A, and the goodness-of-fit 
tests of power-law behavior for the four futures trading networks. The p-values indicate that the degree 
distribution of each network is consistent with a power-law hypothesis. 



Network k„,i„ A p-value 



FTN-all 


926 


2.48 


0.225 


FTN-met 


339 


2.39 


0.106 


FTN-rub 


1196 


2.53 


0.591 


FTN-oil 


351 


2.39 


0.674 
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Fig. 2. The degree distributions P(k) of the four futures trading networks FTN-all, FTN-met, FTN-rub and 
FTN-oil. All the four fitting lines of the subplots have nearly identical slopes, and the scaling parameters A 
are 2.48, 2.39, 2.53 and 2.39, respectively. 

p-values are 0.225, 0.106, 0.591 and 0.674, respectively, which provide sufficient evidence indi- 
cating that the power law is a plausible hypothesis for the degree distribution of each FTN. These 
results imply that highly connected nodes have larger possibilities of happening and dominating 
the connectivity of FTNs. Actually, these nodes correspond to a few active speculators who issue 
numerous orders to the exchange, and consequently have more opportunities to make deals with 
the others. Their trading behaviors also account for the fact that FTNs dynamically expand in 
accordance with the rule of preferential attachment by continuously adding new nodes during the 
lifetime of a network. 

In Fig. 2, we can also see the fact that in low-degree regions, the probability of even degree 
(say k-2n) is larger than that of odd degree (^=2«-i-l), which forms a divergence. As the degree 
increases, the two branches gradually reach to a convergence and the difference is averaged 
out. The branch of even degree conforms to power law more than that of the odd one. One 
reasonable explanation on this observation is that most participants of low degree, entering the 
futures market as speculators, cautiously make a few transactions; but worrying about risk or 
loss, they do not hold the positions long, that is, they open futures positions(buying/selling) and 
close them soon (selling/buying). Certainly, there are some newcomers who are market investors 
and prefer holding positions for long-term profits. To a certain extent, due to the randomicity of 
matching counterpart during trade execution, the two matches (open and close) for a speculator in 
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Fig. 3. The cumulative degree distributions Pcumik) and tlieir maximum likeliliood power-law fits for FTN- 
all, FTN-met, FTN-rub and FTN-oil. The slopes of all fitting lines of the subplots are approximately similar, 
with values of 1.48, 1.39, 1.53 and 1.39, respectively. 



a complete open and close are very possibly involved in different counterparts; thus two degrees 
(edges) will be added to the corresponding node. This observation indicates that there are few 
hedgers with long-term positions in real trading, which is consistent with the fact of low delivery 
ratio in the futures market. 

Now we consider weighted futures trading networks in which our research focus is the rela- 
tionship between strength and degree of nodes. The strength-degree relation in air transportation 
networks [15, 20-23] represents a power law, and various models [48-52] were proposed to 
explain the origin of the power-law correlation. In this paper, the node strength is defined as fol- 
lows. Suppose that the weight of an edge between two nodes is the number of times they appear 
in the same trade record, the strength of a node is defined as the total weight of all edges con- 
necting it. Fig. 4 shows the relationship between the average strength S of the nodes with degree 
k and k in the four FTNs, which indicates that a nontrivial power-law scaling S ~ k^ (J3-1.12 
for all the four networks) exists, and demonstrates the fact that active market participants get 
more active. Such a scale-free behavior being correlated to the occurrence frequency of a trading 
participant provides another justification for the power- law degree distribution of the networks. 
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Fig. 4. Node strength 5 versus degree k for FTN-all, FTN-met, FTN-rub and FTN-oil. Here, the slopes are 
nearly similar to 1.12+0.01. 
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3.2. Small-world Effect 

For complex networks, the average path length and the clustering coefficient are two important 
measures of small-world effect. 

In a network, the average path length (say L) is defined as the number of edges in the shortest 
path between any two nodes, averaged over all pairs of nodes. It plays an important role in 
transportation and communication of a network. We can obtain L - (2/«(n - 1)) 'Zji>jdij. The 
longest shortest path among all pairs of nodes is called the diameter of the network. 

The clustering coefficient C, of a node ; is defined as the ratio of the total number e, of edges 
that actually exist between all its k/ immediate (nearest) neighbors over the number ki{ki - l)/2 
of all possible edges between them, that is, C, = 2ei/ki{ki - 1). The clustering coefficient C 
of the whole network is the average of C, over all nodes, i.e., C - (1/«)2C,. The clustering 
coefficient measures the probability that two neighbors of a node are connected and reveals the 
local cliquishness of a typical neighborhood within a network. 

In recent empirical studies, many real systems show the small-world effect by two crucial fac- 
tors: the average path length and the diameter is relatively small despite often the large network 
size, which grows nearly logarithmically with the number of nodes, and the clustering coefficient 
is larger than that of a reference random network having the same number of nodes and edges as 
the real network. 

We also notice the small-world effect in the four FTNs. On one hand, the average path length 
L and the diameter D are small. For the FTNs, L ranges from 2.307 to 2.642, while the values 
are between 2.774 and 2.933 in the corresponding equivalent random networks generated by the 
Erdos-Renyi model with the same parameters from each FTN. The diameters D in the FTNs are 
5 or 6, while 3 or 4 in the corresponding random networks. On the other hand, these networks 
are highly clustered. The clustering coefficients C are 0.0480, 0.0453, 0.0499 and 0.0370 for 
FTN-all, FTN-met, FTN-rub and FTN-oil, respectively, and the value of each FTN is larger than 
that of the corresponding random network (with values of 0.0016, 0.0011, 0.0022 and 0.0012, 
respectively). These results are presented in Table 3. 

Table 3. Small-world effect is seen in all the four futures trading networks with relatively large clustering 
coefficient C and small average path length L, contrary to the corresponding random networks generated by 
the Erdos-Renyi model with the same parameters (total number of nodes A', total number of edges E and 
average number of edges per node < A: >) of the futures trading networks. 



Network 


N 


E 


<k> 




C 


L 


D 


FTN-all 


100994 


8068676 


159.8 


36878 


0.0480 


2.470 


6 


random 








218 


0.0016 


2.774 


4 


FTN-met 


75262 


3226119 


85.7 


19466 


0.0453 


2.642 


6 


random 








133 


0.0011 


2.905 


4 


FTN-rub 


55828 


3364557 


120.5 


35694 


0.0499 


2.307 


6 


random 








169 


0.0022 


2.766 


3 


FTN-oil 


52208 


1657268 


63.5 


23286 


0.0370 


2.494 


5 


random 








102 


0.0012 


2.933 
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The ratio of each FTN's clustering coefficient over that of its corresponding random network 
is relatively small in comparison with other real systems [4]. This implies no adequate evi- 
dence of local cliquishness of a typical neighborhood within each FTN, which may be due to 
the randomicity of matching counterpart during trade execution. Furthermore, the small L and 
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D is because of 1) the existence of hub nodes (conforming to the large maximum degree k^ax in 
Table 3) that are bridges between different nodes in the networks, and 2) new connections gen- 
erated between the existing nodes previously without direct links, which provide more shortcuts 
to the networks. The shortest path distribution in Fig. 5 shows that most of the shortest paths in 
the four networks are 2 or 3 in length, which helps explaining the small-world effect. 
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Fig. 5. The shortest path distributions P{b) of the four futures trading networks. Obviously, most of the 
shortest paths of each network are 2 or 3 in length. 



3.3. Hierarchical Organization 

To examine the hierarchical organization feature of a FTN, we check C(k), the average clus- 
tering coefficient of all nodes with the same degree k. If C(k) follows a strict scaling law, then 
hierarchical organization exists in the network. Some real networks, such as the World Wide 
Web, the actor network and the Internet, display hierarchical topologies [53]. 

For each FTN, as indicated in Fig. 6, the main part of C(k) obeys a scaling law of 

C{k) ~{k + byl^ (2) 

with nearly similar exponent /? = 0.87 + 0.03, which implies that the network has a hierarchical 
architecture. Although the majority of bargainers who make a few deals have a few links (low 
k), most of these links join hub nodes that connect to each other, resulting in a big C(k). The 
high-^ nodes are hub bargainers, their neighbors that are low-A: nodes are seldom linked to each 
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other, leading to a smaller C(k). This implies that ordinary investors are part of such clusters 
with high cohesiveness and dense interlinking, and the hubs play a bridging role to connect 
many separate small communities together into a complete network. To some extent, the hub 
speculators' trading behaviors flourish the futures market by improving market liquidity, and 
consequently enhance the pricing function of the futures market. 




Fig. 6. Clustering coefficient C(k) as a function C(k) ~ (k + by^ of node degree k with nearly similar 
exponent of 0.87 ± 0.03 for the four futures trading networks, which implies that each network has a 
hierarchical architecture. 



3.4. Betweenness Distribution 

Betweenness is a measure of the centrality of a node in a network; it is also a measure of 
influence that a node has over the spread of information in the network [54]. 

The betweenness of a given node is defined as the sum of the fraction of shortest paths between 
all pairs of nodes in a network that pass through the node. If there is more than one shortest path 
between two nodes, each of which is counted into the total number of shortest paths. To be 
precise, the definition is as follows: 

b{i) = > , -T — (3) 



where bj^ii) is the number of geodesic paths from j to k containing ;, while bjk is the total number 
of geodesic paths linking^ and k. In Fig. 7, we present the cumulative betweenness distribution 
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function Pcumib) - 'E'ly^b P(l^') of nodes for the four FTNs. By exploring the cumulative be- 
tweenness distributions Pcumib) of the four FTNs with the method of Clauset et al. [47], like the 
degree distribution, we find that they follow a power-law function, i.e., Pcumib) ~ b^^'^^^\ corre- 
sponding to the betweenness distribution P{b) ~ b^'^. The exponents oj for FTN-all, FTN-met, 
FTN-rub and FTN-oil are 1.871, 1.850, 1.891 and 1.876, respectively. Furthermore, we also 
make a goodness-of-fit test for each FTN and the results are shown in Table 4. The p-value of 
the goodness-of-fit test for each power-law fit of FTN indicates that the betweenness distribution 
of each network is consistent with a power-law hypothesis. 




10 10 10 10 
b 



Fig. 7. The cumulative betweenness distributions of the four futures trading networks, which follow a 
power-law function of Pcmn(b) ~ b^'-"^'\ The exponents a> for FTN-all, FTN-met, FTN-rub and FTN-oil 
are 1.871, 1.850, 1.891 and 1.876, respectively. 



Table 4. The estimations of the lower bound &„,,„ and the exponent ai, and the goodness-of-fit tests of 
betweenness power-law behaviors of the four futures trading networks. The p-values indicate that the 
betweenness distribution of each network is consistent with a power-law hypothesis. 



Network bmh, <jJ p-value 



FTN-all 


103410.8 


1.871 


0.237 


FTN-met 


149845.9 


1.850 


0.437 


FTN-rub 


50322.7 


1.891 


0.334 


FTN-oil 


42603.8 


1.876 


0.278 
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Information communication (e.g. money or asset transfer) exists in FTNs, and money flow is 
considered as the index of financial markets. The power-law distribution demonstrates that the 
nodes having high betweenness own such a potential to control information flow passing between 
node pairs of the network. Accordingly, the participants with high betweenness centraUty play 
decisive role in promoting market boom and enhancing financial function. 



3.5. Disassortative mixing 

In a network, the preference for a node to attach to others that are similar or difi'erent in 
some way is called assortativity, which is often examined in terms of degree correlation between 
two nodes. Such a correlation is captured by two prominent measures: the assortativity coeffi- 
cient [55, 56] and the average nearest-neighbor degree [57]. 

The assortativity coefficient is essentially the Pearson correlation coefficient of degree between 
a pair of linked nodes, and is defined as 

1 jk 

where qit is the distribution of the remaining degree, i.e., the probability that a node has k other 
edges, qj is the probability that a node has j other edges, ejt is the joint probability distribution 
of the remaining degrees of the two nodes at either end of a randomly chosen edge, Sq is the 
standard deviation of the distribution q^, 5^ - Yjk ^^1k - CEk kqtf'- The coefficient r lies in the 
range -1 < r < 1. Positive r values indicate the existence of assortative mixing pattern, a 
preference for high-degree nodes to attach to other high-degree nodes, while negative r values 
imply disassortative mixing, i.e., high-degree nodes to connect, on average, to low-degree ones. 
Zero r means no assortativity. 

For practical evaluation, the assortativity coefficient can also be calculated by: 

r — (5) 

where y, and A:, are the degrees of the nodes at the ends of the ;th edge, with ;=1...M, M denotes 
the number of edges in the network. We calculate the assortativity coefficients of the four FTNs, 
and obtain the values of r, which are -0.1522, -0.1590, -0.1594 and -0.1608 for FTN-all, FTN- 
met, FTN-rub and FTN-oil, respectively. These negative values undoubtedly show that these 
FTNs are disassortative mixing. 

Another means of measuring the degree correlation is to examine the property of neighbor 
connectivity, i.e., the average nearest-neighbor degree {k„„} of a node with degree k, which is 
defined as 

where P{k'\k) is the conditional probability that an edge points from a node of degree to a node 
of degree k'. If {k„„} increases with k, the network is assortative since it implies that the nodes 
of high degree tend to connect to the nodes of high degree. Conversely, if {k,,,,} decreases with 
k, the network is disassortative, which means that nodes of high degree are more likely to have 
nearest neighbors of lower degree. If {k„„} is independent of k, there is no correlation between 
nodes of similar degree. 
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In Fig. 8, we plot the neighbor connectivity to depict the overall assortativity trends of the 
four FTNs. We can see that the FTNs show a power-law dependence on the degree {k„„) ~ Id', 
with similar exponent fi - -0.52 + 0.03. The result of (km) decreasing with k clearly infers the 
existence of disassortativity in the FTNs. 

The disassortative mixing of FTNs discloses that their assortativities are similar to that of 
technological and biological networks, but are different from social networks. The nontrivial 
correlation property of the FTNs indicates that the active speculators with high degree do not 
frequently trade with each other, on the contrary, they tend to make deals with seldom-trading 
participants. This negative correlation suggests to monitor fluidity risk of the futures market by 
tracking the active speculators. 




3.6. Power-Law Evolution 

Up until now we have explored topological properties of static FTNs. Here we will consider 
some dynamic characteristics of the FTN-all network by checking its evolutional process, which 
can characterize the real trading behavior of the futures market. 

Like many real-life systems [10, 12, 13], FTN-all also exhibits accelerated growth [58] as 
shown in Fig. 9(a), which manifests that the number of edges increases faster than the number 
of nodes. Furthermore, the relationship between the number of edges and the number of nodes 
in FTN-all follows a power-law function e(f) ~ n(f)"- Here, e(t) and n(t) respectively denote the 
numbers of edges and nodes of the network at time t, and a is the exponent. Such a relationship 
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is termed the densification power law by Leskovec et al. [59]. We can see that the fitting curve 
in Fig. 9(a) consists of two segments with different slopes, 1.8 and 3.3, respectively. We notice 
that the critical point of the broken fitting line is located in the place where the network size is 
about 30000, and its corresponding time is exactly the end of the first trading day, which means 
that the mechanism of intraday growth of edges in FTN-all is totally different from that of the 
following days. 

Besides, we observe that the average degree and density of FTN-all super-linearly grow with 
the network size, and both have broken lines, as shown in Fig. 9(b) and (c). The changes in 
both average degree and density over time are directly related to the change of edges' num- 
ber The average degree ^(f) and density d(t) are respectively evaluated by k{t) - 2e(f)/«(f) and 
d(t) = 2e(t)ln(t){n{t) - 1), and thus we obtain ~k{t) ~ n(f)""' and d(t) ~ n{t)"--. In Fig. 9(b), the 
two slopes of the broken line are 0.8 and 2.3, respectively, and in Fig. 9(c) they are -0.2 and 1.3, 
respectively, which explains why the broken line of the density first falls and then rises. Accord- 
ing to the empirical viewpoint of Leskovec et al. [59], the changing trend of the average degree 
of FTN-all in Fig. 9(b) suggests that the network becomes denser as network size increases. 
However, from the viewpoint of network density based on the benchmark of complete graph, the 
curve in Fig. 9(c) implies that FTN-all first becomes sparser and then quickly turns denser as 
network size expands. 

Fig. 9(d) shows that the maximum degree k,„ax{t) of FTN-all grows in power law: k„,„x{t) ~ 
n{tf with /3-1.7. This observation indicates that the active participants in the futures market 
keep vigorous all the time, and the day traders (those who frequently buy and sell futures within 
the same trading day so that all positions will customarily be closed before the market closes on 
that day) constitute the core part of the active participants. 

In some real complex networks [1, 9], the average path length scales logarithmically with the 
number of nodes, and the diameter slowly grows with the network size. However, in FTN-aU 
we notice that the average path length decreases as the number of nodes increases, so does the 
diameter. The average path length decays in a power law with a slope of -0.27 in Fig. 9(e), 
while the diameter of FTN-all decreases as shown in Fig. 9(f). As the network size grows, the 
simultaneous shrinkage of both the average path length and the diameter in FTN-all provides 
another evidence of densification in the network. 

The exposed properties above indicate that the nodes in FTN-all become closer to each other 
as the network size grows. These properties indicate the real trade behavior of the futures market. 
In the futures market, deals come from two sources: the trading by new investors who have just 
entered the market, and the transactions between the existing speculators. Each new investor 
makes a few deals for the sake of caution, while the existing speculators, especially the day 
traders, make a large number of transactions. The latter contribute many deals between the 
participants who have no trading relationship previously, which results in new links between 
the nodes that have no direct connections previously in FTN-all, and consequently accounts for 
the properties mentioned above. Additionally, the different dominant sources of edge addition 
during network expansion cause the discontiguous slopes of two different fitting lines before 
and after the switching point, happening at the time when the network size is about 30000 in 
each logarithmic plot of Fig. 9(a)-(c), and exactly corresponding to the end of the first trading 
day. In the first trading day, almost all investors are newcomers, so the edge growth originating 
from new investors is the dominant factor After that day, the number of new investors and their 
transactions tend to remain steady, while the edges generated among the existing participants 
increase rapidly and outnumber those generated by the new investors. The switch between the 
edge growth's driving factors explains the existence of two different exponents in the power law 
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underlying the network's evolution. 




Fig. 9. (a) The number of edges versus the number of nodes in log-log scale for FTN-all; a power law is 
observed for the slopes of 1.8 and 3.3, respectively in the evolution process. The critical point is at the end 
of the first trading day. (b) The average degree super-linearly grows in a power law with the slopes of 0.8 
and 2.3, respectively, (c) The density changes over time in an approximate power law with slopes of -0.2 
and 1.3, which account for the observation that the curve first falls and then rises. The conversion of edge 
growth's driving factors results in the different slopes in the power law underlying the network's evolution, 
(d) Maximum degree grows in a power law with regard to the network size, and the slope is 1.7. (e) The 
average path length decays in a power law with a slope of -0.27. (f) The diameter decreases as the number 
of nodes increases. 
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4. Conclusion and Future Work 



In this paper, futures trading networks have been extensively analyzed from the perspective of 
complex networks. We constructed FTNs by using real trading records covering three months' 
operation in the Shanghai Futures Exchange. We found a number of interesting statistical prop- 
erties of the networks, including scale-free behavior, small-world effect, hierarchical organiza- 
tion, power-law betweenness distribution, disassortative mixing and power-law evolution. Some 
unique features of the FTNs are possibly due to the randomicity of matching counterpart dur- 
ing deal execution and the continuous generation of new links between previously unconnected 
nodes, which play an important role in determining the structures of futures trade networks. Al- 
though three small networks (or sub-networks) are derived from a partition of the whole market 
dataset, all their statistical properties are consistent with that of the whole market network. 

As only undirected graphs are considered in this paper, it is worth further studying the directed 
FTNs based on trading direction, such as from buyer to seller, which reflects the money flow in 
the futures markets. In Section 3.1, we simply established the weighted FTNs to demonstrate 
that active market participants get more active, but it is not enough. Actually, we can construct 
weighted FTNs or weighted and directed FTNs to investigate the market behaviors of active 
participants. Moreover, the generation model of FTNs is also an important issue for further 
exploration, as it can provide valuable insight on financial trade monitoring and risk control. 
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